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Abstract 

Staphylococcus aureus, a Gram-positive bacterium causes a number of devastating human diseases, such as infective 
endocarditis, osteomyelitis, septic arthritis and sepsis. S. aureus SraP, a surface-exposed serine-rich repeat glycoprotein 
(SRRP), is required for the pathogenesis of human infective endocarditis via its ligand-binding region (BR) adhering to 
human platelets. It remains unclear how SraP interacts with human host. Here we report the 2.05 A crystal structure of the 
BR of SraP, revealing an extended rod-like architecture of four discrete modules. The N-terminal legume lectin-like module 
specifically binds to N-acetylneuraminic acid. The second module adopts a (3-grasp fold similar to Ig-binding proteins, 
whereas the last two tandem repetitive modules resemble eukaryotic cadherins but differ in calcium coordination pattern. 
Under the conditions tested, small-angle X-ray scattering and molecular dynamic simulation indicated that the three C- 
terminal modules function as a relatively rigid stem to extend the N-terminal lectin module outwards. Structure-guided 
mutagenesis analyses, in addition to a recently identified trisaccharide ligand of SraP, enabled us to elucidate that SraP 
binding to sialylated receptors promotes S. aureus adhesion to and invasion into host epithelial cells. Our findings have thus 
provided novel structural and functional insights into the SraP-mediated host-pathogen interaction of S. aureus. 
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Introduction 

The serine-rich repeat glycoproteins (SRRPs) are a family of 
adhesins encoded by Gram-positive bacteria that mediate 
attachment to a variety of host cells or bacteria themselves [1], 
SRRPs typically consist of a signal peptide at the N-terminus, a 
short SRR (SRR1, ~50-170 residues), a ligand-binding region 
(BR, -250-500 residues) followed by a much longer SRR (SRR2, 
~400-4000 residues), and a C-terminal LPXTG motif anchoring 
to the cell wall [1]. The BRs of SRRPs from different pathogenic 
bacteria have varying primary sequences and bind to diverse 
targets from carbohydrates to proteins [1]. 

In addition to having highly variable sequences, the BRs from 
different bacteria are composed of distinct modules. The diversity 
of BR modules and combinations contributes to the multiple 
functions of SRRPs. The only four known BR structures to date 
have identified five distinct modules [2-4] . The BR of Streptococcus 
parasanguinis Fapl contains two modules: an N-terminal helical 
module and a C-terminal CnaA module [2], whereas Streptococcus 
gordonii GspB has a BR of three modules: CnaA, Siglec and a 
unique module of unknown function [3] . In addition, the recently 
reported BR structures of the two SRRP paralogs (Srr 1 and Srr2) 



from Streptococcus agalactiae defined two immunoglobulin-fold 
modules, which specifically bind to the host fibrinogen [4]. 
However, the module composition and corresponding molecular 
functions of most BRs remain unknown, which largely impedes the 
understanding of the pathogenesis mechanism of SRRPs. 

S. aureus is a human pathogen that causes a wide range of 
debilitating and life-threatening infections [5]. S. aureus encodes a 
2,271-residue SRRP termed serine-rich adhesin for binding to 
platelets (SraP), that is involved in the pathogenesis of infective 
endocarditis [6]. Moreover, the BR (residues Phe245-Asn751) of 
SraP, termed SraPgR, mediates intraspecies interaction and 
promotes bacterial aggregation [7]. We determine the 2.05 A 
crystal structure of SraPaR, revealing a rod-like tandem organi- 
zation of four discrete modules: a legume lectin-like module, a 
module with a fi-grasp fold, and two tandem cadherin-like 
modules that create the rigid stem of SraP BR . Further structural 
and biochemical analyses reveal that the legume lectin-like module 
specifically binds to N-acetylneuraminic acid (Neu5Ac), which 
may mediate adhesion to host sialylated receptors. These findings 
increase our knowledge of the diverse BR modules of SRRPs, and 
provide structural insights into a novel surface protein that 
mediates interaction of S. aureus with host epithelial cells. 
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Author Summary 

Staphylococcus aureus is an important pathogen that 
causes a range of human diseases, such as infective 
endocarditis, osteomyelitis, septic arthritis and sepsis. The 
increasing resistance of S. aureus to most of the current 
antibiotics emphasizes the need to develop new ap- 
proaches to control staphylococcal infections. As a surface- 
exposed serine-rich repeat glycoprotein (SRRP), S. aureus 
SraP is involved in the pathogenesis of infective endocar- 
ditis via its ligand-binding region (BR) adhering to human 
platelets. However, little is known about how SraP 
interacts with its host receptor(s). Through structural and 
functional analyses of the BR domain, we have discovered 
a specific binding of SraP to N-acetylneuraminic acid 
(Neu5Ac), in agreement with a recent report of the 
trisaccharide ligand of SraP. Further mutagenesis analysis 
showed that SraP binding to Neu5Ac and the trisaccharide 
promotes S. aureus adhesion to and invasion into host 
epithelial cells. These findings increase our knowledge of 
surface protein mediated interaction of S. aureus with host 
epithelial cells. 

Results 

Overall structure of SraP BR 

Each asymmetric unit of the final model at 2.05 A resolution 
contains a single SraPBR molecule of residues Thr251-Asn751. 
The N-terminal residues Phe245-Thr250 are not visible due to 
their poor electron density. SraP BR folds into a slightly bent, rod- 
like structure of 160 A in length that has four discrete modules: a 
head-like N-terminal module followed a stem of three all-R 
modules (Fig. 1A). All modules have a dominant P-strand 
secondary structure. During the model building and refinement 
process, three peaks of electron density at the 24 a level were 
observed in the | Fo \ — \Fc\ Fourier difference map, indicating the 
presence of three metal ions. Atomic absorption spectroscopy 
assigned these metal ions to Ca z+ ; we thus termed them Ca-1, Ca- 
2 and Ca-3 accordingly. The structure also contains a sucrose and 
a 2-(jV-morpholino)-ethanesulfonic acid (MES) molecule, which 
were introduced from the cryoprotectant and crystallization 
buffer, respectively. 

The N-terminal module adopts a jelly-roll fold with a (3- 
sandwich ((3 1 — P 1 7) core architecture of two antiparallel P-sheets 
packed against each other (Fig. IB). Beyond the core structure, two 
ot-helices (ot 1 and 0(2) pack on either side of the lateral of the P- 
sandwich and partially seal the hydrophobic lateral openings 
between the two P-sheets. A molecule of sucrose binds to the 
protruding loops at the distal end of the N-terminal module 
(Fig. IB). A structural similarity search using the DALI server [8] 
revealed that this module is most similar to legume lectins, despite 
sharing a sequence identity of ^20%. The top hits include lectins 
from legume plants such as Pisum sativum (PDB 2BQP) [9] and 
Robinia pseudoacacia (PDB: 1FNY) [10], with a Z-score of 22-23 and 
root mean square deviation (RMSD) of 2.3-2.5 A over ~200 Cot 
atoms. Thus, we termed the N-terminal module L-lectin. 

The second module possesses a ubiquitin-like P-grasp fold (P- 
GF) in the immunoglobulin (Ig)-binding protein superfamily [11]. 
The module contains a twisted four-stranded mixed P-sheet ((3 1 9- 
P18-P23-P21) packed against a two-stranded antiparallel P-sheet 
(P20-P22) and an ot-helix oc3 (Fig. IB). 

The C-terminal tandem repeat of two modules share a sequence 
identity of 56% and a similar overall structure with an RMSD of 
0.81 A over 81 Cot atoms. The two modules are linked in a head- 
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Figure 1. Overall structure of the full-length SraP B R. A) A 

diagram to show the organization of SraP and BR. B) Cartoon 
representation of the overall structure of SraP B R. C) A topology diagram 
of SraP BR . The L-lectin, |3-GF, CDHL-1 and CDHL-2 modules are shown in 
cyan, red, yellow and green, respectively. The sucrose molecule is 
shown as yellow stick and the bound calcium ions are shown as gray 
spheres. The secondary structural elements are labeled sequentially. 
doi:1 0.1 371/joumal.ppat.l 0041 69.g001 

to-tail fashion, indicating duplication of the coding region during 
evolution. Each module consists of a P-sandwich of three P-sheets 
(Fig. IB & 1C). Structural analysis revealed that the two modules 
resemble eukaryotic cadherins of known-structure (PDB 3UBF, Z- 
score 8.6, RMSD 2.3 A over 85 Cot atoms and PDB 4APX, Z- 
score 9.5, RMSD 2.3 A over 87 Cot atoms) [12,13]. Thus the 
tandem cadherin-like (CDHL) modules were termed CDHL-1 and 
CDHL-2, respectively. 

Specific binding of the L-lectin module to 
N-acetylneuraminic acid 

The L-lectin module is structurally similar to legume lectins 
(Fig. 2A), a large family of carbohydrate-binding proteins with 
diverse activities [14]. Legume lectins commonly coordinate a 
Ca 2+ in addition to a transition metal ion, usually Mn + [15]. In 
contrast, the L-lectin module has only a Ca 2+ -binding site. At the 
apex of the L-lectin module, the Ca 2+ (named Ca-1) was 
embedded in a 7-coordinate geometry (Fig. 2B). The seven 
coordinates are from the side-chain oxygen atoms of Asp365 
(bidentate coordination from OSl and 082), Asp382 (082), 
Asn369 (OSl), the main-chain oxygen atom of Tyr367 and two 
water molecules (Watl and Wat2). The two water molecules were 
further stabilized by the main-chain oxygen atoms of Asp330, 
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Figure 2. Binding of the L-lectin module to Neu5Ac. A) Superposition of the L-lectin module against P. sativum pea lectin (PDB 2BQP) and ft. 
pseudoacacia bark lectin (PDB 1 FNZ). The L-lectin module of SraP BR is colored in cyan, with the carbons of sucrose colored in yellow. The pea and bark 
lectins are colored in magenta and green, respectively. B) The sucrose-binding site and the Ca-1 coordination. The |Fo|-|Fc| omit map of sucrose is 
shown as blue mesh and countered at 3.0 a. The sucrose and MES molecules and their interacting residues are shown as sticks and colored in yellow, 
green and cyan, respectively. The water molecules are shown as red spheres and Ca-1 is shown as grey sphere. The polar interactions with sucrose are 
shown as black dotted lines, whereas the coordinate bonds with Ca-1 are indicated by blue dotted lines. C) Comparison of the ligand binding sites of 
L-lectin module (cyan), pea lectin (magenta) and bark lectin (green). Residues binding to the ligand are shown in sticks. D) SPR sensor diagram for the 
eight tested monosaccharides (D-glucose, D-galactose, D-mannose, L-fucose, N-acetyl-D-glucosamine, N-acetylneuraminic acid, D-xylose, and L- 
Rhamnose) against the immobilized L-lectin module. E) The steady-state equilibrium curve of Neu5Ac at a concentration varying from 0.12 to 10 mM. 
The binding affinity of the L-lectin module towards Neu5Ac was calculated by using the 1:1 Langmuir model, the fit of model to the data was 
determined by an % 2 value, and the % 2 is 8.98. F) A docking model of Neu5Ac (green sticks) binding to the L-lectin module (cyan). Polar interactions 
are shown in black dashed lines. 
doi:1 0.1 371 /journal.ppat.1 0041 69.g002 



Ala349, 081 of Asp382 and another water molecule (Wat3) 
(Fig. 2B). 

A molecule of sucrose is fixed by a cluster of loops protruding 
from the apex of the L-lectin module. The sucrose molecule has a 
"bent-back" conformation with the glucose and fructose moieties 
perpendicular to each other (Fig. 2B). The glucose moiety is 
inserted in the pocket, and adopts a conformation nearly parallel 
to the two P-sheet layers. The sugar ring makes hydrophobic 
interactions with Tyr367 and the morpholine ring of MES, 
whereas the hydroxyl groups are stabilized by hydrogen bonds 
with Ala478-Noc, Asp330-OSl, and three water molecules (Wat3- 
5), which are further fixed by residues Asp330, Asn347 and Ala349 
(Fig. 2B). In contrast, the solvent-exposed fructose moiety is bent 
through interactions with MES and two water molecules (Wat6 
and Wat7) (Fig. 2B). The sucrose binding residues, especially the 
stacking residue Tyr367, are structurally conserved in the legume 
lectins (Fig. 2C). 

To identify the favored saccharide of SraP, we first detected the 
binding affinity of the L-lectin module towards eight common 
monosaccharides using the surface plasmon resonance assays. 
Among these monosaccharides, only Neu5Ac bound to the L- 
lectin module (Fig. 2D). In consequence, we determined that the 



L-lectin module has an equilibrium dissociation constant (Ka) of 
0.54 mM towards Neu5Ac (Fig. 2E), comparable to previously 
reported values of legume lectins [16]. 

Afterwards, we attempted to obtain the Neu5Ac-complexed 
structure without success. Therefore, we docked Neu5Ac to the 
structure of the L-lectin module with the sucrose-binding pocket as 
the search grid. Neu5Ac was docked at a position overlapping the 
glucose moiety of sucrose, with a shift of ~ 1.5 A towards Ca-1. In 
the model, Neu5Ac is stacked against Tyr367 via hydrophobic 
interactions, and makes direct polar interactions with the side 
chains of Ser293, Asn347, Tyr367 and Asn369 and the Not atoms 
of Gly477 and Ala478 (Fig. 2F). 

Structural similarity of the (3-GF module to Ig-binding 
proteins 

The B-GF module adopts a ubiquitin-like B-grasp fold in the Ig- 
binding superfamily [11]. DALI search [8] suggested the module 
resembles the B 1 domain of mucus-binding protein type 2 repeat 
Mub-R5 from Lactobacillus reuteri [17] (PDB code 3157, Z-score 9.1, 
RMSD 1 .6 A, over 68 Cot atoms), and protein L (PpL) from 
Peptostreptococcus magnus [18] (PDB code 1HEZ, Z-score 4.5, RMSD 
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Figure 3. Superpositions of the p-GF module against A) the Ig-binding proteins of B1 domain of mucus-binding protein type 2 
repeat Mub-R5 from L reuteri (PDB 3I57), and B) that of Protein L (PpL) from P. magnus (PDB 1 HEZ). The |3-GF module, the B1 domain 
and PpL are colored in red, grey and light pink, respectively. 
doi:1 0.1 371 /journal.ppat.1 0041 69.g003 



2.5, over 55 Co( atoms). The Bl domain belongs to a family of Ig- 
binding proteins [19] that have a core structure of an a-helix 
packed against a four-stranded (3-sheet [17,20]. The major 
differences are from the helix 0(3 and the two lateral fi-strands 
((3 1 9 and (521). In addition, the fS-GF module of SraP BR contains 
two extra fi-strands, [320 and |322, which are substituted by loops 
or a-helix extensions in the B 1 domain (Fig. 3). Mub-R5 interacts 
in vitro with a large repertoire of mammalian Ig proteins including 
secretory IgA, whereas PpL binds to the Vl domain of Ig K chain 
[19,21]. Complex structures indicated that formation of a (3-zipper 
is necessary for the binding of PpL to the Vl domain of Ig K chain 
[18] and IgG or IgM [22]. However, the corresponding (3-strands 
P 1 9 and (321 are much shorter in the (3-GF module, which might 
not be capable of forming a (3-zipper. 

The tandem cadherin-like modules create a relatively 
rigid SraP BR stem 

The tandem CDHL modules resemble eukaryotic cadherins in 
a superfamily of calcium-dependent adhesions (Fig. 4A & 4B). 
Unlike the eukaryotic cadherins, each of which coordinates three 
Ca 2+ ions (PDB: 1L3W) [23], CDHL-1 and CDHL-2 binds to Ca- 
2 and Ca-3 with a 7-coordinate geometry, respectively. Ca-2 is 
fixed by Asp573 and Asp601 (bidentate coordination from 081 
and 052), Asn602 (081), Asp645 (052), and the main-chain 
oxygen atom of Lys575 (Fig. 4C), whereas Ca-3 is coordinated to 
Asp661 and Asp690 (bidentate coordination from 081 and 082), 
Asn691 (OSl), Asp734 (082), and the main-chain oxygen atom of 
Thr663 (Fig. 4D). The coordinate bonds between Ca 2+ and 
corresponding residues have a length from 2.2 to 2.6 A. 

The small-angle scattering of X-rays (SAXS) is usually applied 
to address the flexibility and conformational states of biological 
macromolecules in solution [24] . To explore the role of Ca 2+ , we 
used SAXS to compare the overall structure of SraP BR in the 
presence or absence of Ca 2+ . A difference in wide-angle scattering 
curves of SAXS indicated that SraP BR adopts different confor- 
mations with or without Ca 2+ . Compared to the Ca + -free form, 
the envelope of Ca 2+ -bound SraP BR correlates much better with 
the SraP B R crystal structure (Fig. 5A). The larger discrepancies of 
the Ca 2+ -free SraP BR are mainly resulted from the tandem CDHL 
modules, indicating that binding of Ca 2+ makes the tandem 
CDHL modules more rigid, and thereby facilitates the extended 
conformation of SraP BR in solution. 

We also performed molecular dynamics simulations of SraP BR 
in the presence or absence of Ca 2+ . Ca 2+ -bound SraP BR remains 
extended and shows slight conformational changes over the 
simulation time, whereas removal of Ca 2+ caused the curling of 
SraP BR (Fig. 5B). Geometric analysis of the Ca 2+ -coordinating 
residues revealed larger fluctuations at the two Ca 2+ -binding 



junctions of Ca 2+ -free SraP BR , suggesting the importance of Ca 2+ 
for the structural integrity of SraP BR . Moreover, the increased 
RMSD values indicated the two junctions in Ca 2+ -free SraP BR 
undergo dramatic conformational changes. Given the CDHL-2 
modules superimposed, the other three modules adopt a more 
curved conformation and are projected to the opposite side upon 
the loss of Ca 2+ (Fig. 5B). 
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Figure 4. Structural comparison and the calcium coordination 
patterns of the CDHL modules. Structural comparison of A) CDHL-1 
and B) CDHL-2 to the classic cadherin (PDB 3MVS). The CDHL-1 module, 
CDHL-2 module and the classic cadherin and corresponding binding 
calcium ions are shown in yellow, green and red, respectively. The 
coordination patterns of C) Ca-2 and D) Ca-3. The coordinate residues 
are presented in green and yellow sticks, respectively. The coordinate 
bonds are indicated as blue dotted lines. 
doi:1 0.1 371/joumal.ppat.l 0041 69.g004 
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Figure 5. The relative rigid conformation of SraP BR upon binding to calcium. A) The SAXS data and envelops for SraPgR in the presence or 
absence of Ca 2+ . B) The final states of the 20-ns molecular dynamics simulation of SraP B R in the presence (gray) or absence (lightpink) of Ca 2+ , 
compared to the crystal structure of SraP BR with the CDHL-2 modules superimposed. Superposition of C) L-lectin&p-GF, D) (3-GF&CDHL-1 and E) 
CDHL-1&2 to the corresponding modules in the full-length SraP BR structure, respectively. Residues at the interface are shown as sticks, polar 
interactions are indicated as black dotted lines and the coordinate bonds with Ca-2 and Ca-3 are indicated as blue dotted lines. 
doi:1 0.1 371 /journal.ppat.1 0041 69.g005 



The rod-like, four-module structure of SraP BR has three 
junctions with a buried interface of 700, 400 and 220 A 2 , 
respectively (Fig. IB). The interface between L-lectin and (3-GF 
is maintained by several hydrogen bonds including Asn257- 
Glu493, Arg303-Asn519, Arg303-Ser494, Leu341-Asn519, and 



Asn466-Asn562 (Fig. 5G). The second interface is formed by 
residues from |3-GF and CDHL-1 via hydrogen bonds of Lys536- 
Asn602, Gly537-Asn602 and Asp503-Thr604 (Fig. 5D). At the 
third junction, Glu587 and Val588 of CDHL-1 form two 
hydrogen bonds with Asn691 of CDHL-2 (Fig. 5E). The second 
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and third interfaces are relatively small; however, they are more 
stable due to the contribution from coordinate bonds of Ca-2 to 
Asn602 and Ca-3 to Asn691. 

To further investigate the plasticity of these junctions, we 
determined three crystal structures for each of two consecutive 
modules: L-lectin&fj-GF (Phe245-Lys575), fi-GF&CDHL-l 
(Ser494-Thr663), and CDHL-1&2 (Ala576-Asn751). We superim- 
posed each of these three structures over the corresponding two 
modules of the full-length SraP BR structure, always with the C- 
terminal modules aligned. The results revealed slight twisting and/ or 
translation with intermodule angle changes of 9.4°, 1 1.7° and 14.6° 
for the three junctions, respectively (Fig. 5C~E). In detail, residues 
Val496-Gln498 in L-lectin&fj-GF, Phe571&Thr572 in fj- 
GF&CDHL-l, and Asp661-Thr663 in CDHL-1&2 undergo slight 
conformational changes. Except for a short disordered segment at the 
N-terminus of CDHL-1&2 (Fig. 5E) due to the deletion of two Ca-2 
coordinate residues Asp573 and Lys575, we did not find secondary- 
structure change in any module pairs. Moreover, merging these three 
structures via sequential alignment of the same module resulted in a 
total twist of 5.4° along the axis of SraP BR structure (Fig. SI), 
suggesting that the slight interdomain twists of the three junctions are 
randomly occurred and could be canceled out. Together, the results 
indicated that in the presence of calcium, SraP BR adopts a relatively 
rigid rod-like structure under all conditions tested. 

SraP promotes S. aureus adhesion and invasion to A549 
cells through sialylated receptors 

Bacterial attachment and colonization at the surface of host cells 
have been thought to be mediated by specific binding of BRs to 
glycoconjugates [1]. It has recently been reported that SraP binds 
to the salivary agglutinin gp340 via the Neu5Ac moiety of the 
trisaccharide Neu5Aca(2-3)Galf3(l-4)GlcNAc [25]. In fact, gp340 
and homologs are also expressed in lung epithelial cells [26]. To 
determine if the L-lectin module mediates SraP BR adhesion, we 
incubated a monolayer of human lung epithelial A549 cells with 
green fluorescent protein (GFP)-fused SraP B R and individual 
modules. The results indicated that full-length SraP B R and the L- 
lectin module, but not the CDHL-1&2 modules or the fi-GF 
module, specifically adhered to the A549 cells (Fig. 6). Tyr367 is a 
key residue to make hydrophobic interactions with sucrose and the 
docked Neu5Ac, we thus constructed the Y367G mutant proteins 
to test the adhesion to A549 cells. As shown in the CD spectra, the 
mutation of Y367G did not introduce significant structural 
changes to SraP BR or the L-lectin module (Fig. S2). In contrast, 
a Y367G mutation in both the full-length SraP BR and the L-lectin 
module almost completely abolished the adhesion capacity. 
Moreover, the addition of 5 mil Neu5Ac completely inhibited 
the adhesion of SraP BR to the A549 cells (Fig. 6). Quantification of 
the fluorescent signal of three representative frames for each image 
further confirmed that only the full-length SraP BR and the L-lectin 
module are capable of specific binding to A549 cells (Fig. S3). 
These results demonstrate that the adhesion of SraP BR to A549 
cells is mediated by the specific recognition of the L-lectin module 
towards Neu5Ac. 

Furthermore we performed comparative assays of bacterial 
adhesion and invasion to A549 cells using S. aureus strain NCTC 
8325 and an isogenic AsraP mutant. Deletion of sraP resulted in an 
approximately 40% decrease in adhesion, as compared to the 
wild-type (Fig. 7A). As a result the level of invasion was decreased 
by ~50% (Fig. 7B). These results indicate that SraP contributes to 
S. aureus adhesion to and invasion into host cells. 

Together with the recently reported SraP ligand, the trisaccha- 
ride Neu5Acoi(2-3)GalfS(l-4)GlcNAc [25], our results strongly 
suggested that the specific binding of SraP BR to the ligand promotes 



the S. aureus adhesion to host cells. We initially tried to determine the 
complex structure of the trisaccharide with SraP BR or the L-lectin 
module without success. As an alternative, we docked the 
trisaccharide to the structure of the L-lectin module (Fig. 7C). In 
the docking model, the NeuSAc moiety of the trisaccharide adopts 
an almost same position to that of Neu5Ac (Fig. 2F). In addition, the 
moieties of Gal and GlcNAc are stabilized via hydrogen bonds by 
residues Asn374 and Ser371, respectively (Fig. 7C). 

To determine whether the interactions between the L-lectin 
module and the trisaccharide contribute to the SraP BR -mediated 
adhesion, we used neuraminidase, fi-galactosidase or N-acetylglu- 
cosaminidase to differentially remove Neu5Ac, Gal, and GlcNAc 
from the surface of A549 cells (Fig. 7D). Treating A549 cells with 
neuraminidase alone resulted in an approximately 36% decrease in 
the adhesion of S. aureusNCTC 8325. In contrast, the digestion with 
either fi-galactosidase or N-acetylglucosaminidase did not signifi- 
cantly affect the adhesion. Further addition of fi-galactosidase and 
N-acetylglucosaminidase to the neuraminidase-treated A549 cells 
did not significantly lower the adhesion level (Fig. 7D, the 6 th and 7 th 
columns). These results indicated that Neu5Ac is the moiety at the 
non-reducing end of the trisaccharide, and a major receptor to the 
SraP-mediated bacterial adhesion of A549 cells. Consistently, the 
wild-type S. aureus and AsraP mutant showed comparable levels of 
adhesion to the neuraminidase-treated epithelial cells (Fig 7D, the 
4 th and 8 th columns). The adhesion level of the wild-type S. aureus to 
the neuraminidase-treated A549 cells (Fig. 7D, the 4 th column) is 
similar to that of the AsraP mutant to the untreated A549 cells 
(Fig. 7A). We thus concluded that SraP is the major S. aureus adhesin 
that recognizes the sialylated host receptors. 

Discussion 

The specific recognition of the L-lectin module to 
sialylated receptors may be a universal mechanism for 
staphylococcal adhesion to host cells 

Structural analyses combined with epithelial adhesion experi- 
ments demonstrate that SraP plays an important role in mediating 
bacterial adhesion to host cells by recognizing the L-lectin module. 
Legume lectins are a large family of proteins primarily found in the 
seeds of legume plants that have a similar fold but distinct 
carbohydrate-binding specificities [14]. They typically adopt a 
quaternary structure of dimers or tetramers that enhances sugar 
binding specificity or affinity [27]. The architecture of the legume 
lectin fold has also been found in the animal calcium-dependent 
lectin ERGIC-53/MR60, a mannose-binding protein involved in 
the export of soluble glycoproteins from the endoplasmic 
reticulum [28]. We report here the first legume lectin fold- 
containing protein in bacteria. This prokaryotic monomeric L- 
lectin module mediates adhesion to host cells by recognizing 
Neu5Ac, usually the non-reducing terminal residue of glycocon- 
jugates of extracellular receptors. Bioinformatics analysis suggested 
that the L-lectin module might also exist in other proteins in 
Staphylococci and Streptococci (Fig. S4A). Moreover, the residues 
involved in Neu5Ac binding are relatively conserved among these 
putative L-lectins (Fig. S4B). Therefore, we hypothesize that this 
adhesion mechanism mediated by the L-lectin module operates in 
other staphylococcal and streptococcal species. 

The L-lectin module is projected outwards mainly by two 
cadherin-like modules 

To scan the host receptors, the L-lectin module should be 
projected outwards from the bacterial surface. In addition to a 
long region of SRR2 that crosses the bacterial cell wall, SraP has a 
relatively rigid stem of three modules: a fi-GF and two CDHL 
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DAPI FITC Merge DAPI FITC Merge 




Figure 6. Contribution of the L-lectin module to bacterial adhesion to human lung epithelial cells. A549 cells were incubated with GFP- 
fused SraP BR (termed BR for short) and truncations (L-lectin, p-GF and CDHL-1&2) or mutants (BR V367G and L-lectin Y 367G); respectively. GFP is used as a 
negative control. The nuclei were stained with DAPI (blue) and the adhered proteins were detected with anti-GFP mouse IgG, followed by FITC 
conjugated goat anti-mouse IgG (green). 
doi:1 0.1 371 /journal. ppat.1 0041 69.g006 



modules. The classic cadherins in vertebrates bridge the 
intermembrane space between neighboring cells by forming 
Zran,?-adhesive homodimers through membrane-distal extracellular 
domains [29]. The extracellular domain of most cadherins often 
contains three conserved Ca """-coordination sites at the inter- 
domain junction [30]. In contrast, each CDHL module in SraP BR 
binds to only a single Ca + that does not superimpose on any of the 
three Ca 2+ ions in eukaryotic cadherins. Nevertheless, multiple- 
sequence alignments suggested strong conservation among Gram- 
positive bacteria of SraP BR Ca """-binding residues (Fig. 8A). 

Unlike the monomeric form of SraP BR , the recombinant 
CDHL1&2 exists as a dimer in solution as confirmed by size- 
exclusion chromatography and chemical cross-linking assays (Fig. 
S5). The structure revealed that the homodimer of CDHL1&2 
buries an interface area of ~600 A /per subunit, as calculated by 
PISA (http:/ /www.ebi.ac.uk/ msd-srv/ prot_int/cgi-bin/ piserver) 
server [3 1] . The interface is mainly stabilized by polar interactions, 
in contrast to the hydrophobic dimeric interfaces of eukaryotic 
cadherins [29]. CDHL-2 from one subunit packs against the 
junction between CDHL-1 and CDHL-2 from the symmetric 
subunit, and vice versa (Fig. 8B). This dimerization pattern may 
explain the result that SraP mediates intraspecies interaction and 
promotes aggregation of S. aureus ISP479C [7]. However, we did 
not observe significant decrease of aggregation or biofilm 
formation upon the deletion of sraP in S. aureus NCTC 8325. 
The different results might be due to the variation of S. aureus 
strains. Sequence homology search indicated that some surface 
proteins in Gram-positive bacteria contain two or more CDHL 
modules, suggesting that those proteins with multiple CDHL 
modules have a high potential to mediate intraspecies aggregation. 

SraP BR possesses a unique functional modularization 
pattern 

SRRPs have been identified in a variety of Gram-positive 
bacteria and function as virulence factors in a wide spectrum of 
infections [1]. The diversity of these infections (e.g. endocarditis, 



meningitis and pneumonia) correlates with the variability of BRs. 
Furthermore, the few BR ligands that have been identified to date 
range from carbohydrates (such as sialyl-T antigen) [32] to 
proteins (such as keratins) [33,34]. In addition to the five distinct 
modules defined in the four previously reported BR structures [2- 
4], our SraP B R structure identified three types of unique modules. 
Notably, although the fj-GF module resembles the Ig-binding 
proteins, such as Bl domain of Mub-R5 and PpL [19,21], the 
binding of the p-GF module towards human IgG, IgA or IgM 
could not be detected, in agreement with our structural analysis. 
We thus propose that the P-GF module, in addition to the two 
CDHL modules, functions as a relatively rigid stem to project the 
L-lectin module outwards. 

The two structure-known SRRPs, Fapl and GspB, undergo 
significant inter-module angle changes, which are regulated by pH 
and ligand binding, respectively [2,3]. In contrast, SraP BR appears 
to adopt a relatively rigid, rod-like conformation when colonizing 
a host, for the concentration of free Ca 2 """ in the extracellular space 
or blood is stringently maintained at 1.1—1.3 mM [35]. The rigid 
architecture of SraP BR enables the globular L-lectin module to 
extend outwards from the bacterial surface for scanning host 
receptors. This strategy to expose the functional modules has been 
observed for other bacterial adhesins such as the fibrillar antigen 
I/II from S. mutans [36], and the rod-like surface protein SasG 
from S. aureus [37]. 

Materials and Methods 

Cloning, expression, and purification of SraP BR 

The genomic DNA from Staphylococcus aureus NCTC 8325 was 
prepared for gene cloning. The DNA sequences (GeneBank, the 
accession number of YP_501439.1) encoding SraP BR (Phe245- 
Asn751) and other SraP truncates were cloned into pET28a with 
an N-terminal Hisg-tag or pET28a with a C-terminal GFP-tag, 
respectively. The constructs were overexpressed in E. coli strain 
BL21 (Novagen) using LB culture medium (10 g NaCl, 10 g 
Bacto-Tryptone, and 5 g yeast extract per liter). The cells were 
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Wild-type AsraP neuraminidase - .. + + + + 

p-galactosidase - + - - + + - 



N-acetylglucsaminidase - - + - - + - 

Figure 7. Importance of SraP in bacterial adhesion to and invasion into epithelial cells. Contribution of SraP to the A) adhesion and B) 
invasion of S. aureus to A549 cells. Experiments were performed in triplicate. Results of representative experiments are presented as mean ± standard 
deviation. C) A docking model of trisaccharide binding to the L-lectin module. The trisaccharide and its interacting residues are shown as sticks and 
colored in magenta and cyan, respectively. D) Adhesion of the wild-type or AsraP mutant S. aureus to A549 cells pre-treated with neuraminidase, p- 
galactosidase and/or N-acetylglucosaminidase. Experiments were performed in triplicate. The standard error of the mean (SEM) derived from 
triplicate treatments are indicated as bar graph. All data were normalized against bacterial adhesion to mock-treated cells. Statistical analyses were 
performed using a one-way ANOVA. 
doi:1 0.1 371 /journal.ppat.1 0041 69.g007 



grown at 37°C to an OD 600nm of 0.6. Expression of the 
recombinant protein was induced with 0.2 mM isopropyl pVD-1- 
thiogalactopyranoside (IPTG) at 16°C for another 20 hr before 
harvesting. Bacteria were collected by centrifugation at 8,000 Xg 
for 10 min and resuspended in 30 ml lysis buffer (20 mM Tris-Cl, 
pH 8.8, 100 mM NaCl). After sonication for 2.5 min followed by 
centrifugation at 1 2,000 xg for 25 min, the supernatant containing 
the His-tagged protein was collected and loaded onto a Ni-NTA 
column (GE Healthcare) equilibrated with the binding buffer 
(20 mM Tris-Cl, pH 8.0, 100 mM NaCl). The target protein was 
eluted with 300 mM imidazole, and loaded onto a Superdex 200 
column or Superdex 75 column (GE Healthcare; 20 mM Tris-Cl, 
pH 8.0, 100 mM NaCl). The purity of protein was assessed by 
electrophoresis and the protein sample was stored at — 80°C. 

The selenium-Met (SeMet) labeled L-lectin&P-GF protein was 
expressed in E. coli strain B834 (DE3) (Novagen). Transformed 
cells were grown at 37°C in SeMet medium (M9 medium 
with 25 (ig/ml SeMet and the other essential amino acids at 
50 u,g/ml) containing 30 Hg/ml kanamycfn until the OD 600nm 
reached 0.6, and were then induced with 0.2 mM IPTG at 16°C 
for 20 hr. SeMet substituted protein was purified with the same 
procedure as the native protein. 



The oligomerization state of CDHL-1&2 was analyzed by 
Superdex 75 column. The protein markers are bovine serum 
albumin, ovalbumin, chymotrypsinogen A, myoblobulin and 
ribinuclease A, which have a molecular weight of 67, 44, 25, 17 
and 13.7 kDa, respectively (GE Healthcare). 

Crystallization, data collection and processing 

All crystals were grown using the hanging drop vapor diffusion 
method, with a drop of 1 J_ll protein solution mixed with 1 |xl of 
reservoir solution equilibrated against 500 |J,1 of the reservoir 
solution. The proteins for crystallization were concentrated by 
ultrafiltration (Millipore Amicon) to 30, 38, 20 and 20 mg/ml for 
the full-length SraP BR , L-lectin&P-GF, P-GF&CDHL-l and 
CDHL-1&2, respectively. The SeMet substituted L-lectin&P-GF 
protein for crystallization was concentrated to 38 mg/ml. Crystals 
of SraP BR were grown at 28°C, whereas others were grown at 
16°C. Crystals were obtained from 0.8 M (NH 4 ) 2 S0 4 and 0.1 M 
MES, pH 6.0 for SraP BR ; 2.5 M sodium formate, 0.1 M sodium 
acetate pH 4.6 for the native L-lectin&P-GF; 10% PEG 6000, 
0.1 M MES, pH 6.0, and 1.0 M lithium chloride for the SeMet 
substituted L-lectin&P-GF; 2.0 M (NH 4 ) 3 P0 4 , 0.1 M Tris-Cl, 
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Figure 8. Homodimerization of the tandem CDHL modules. A) Multiple-sequence alignment of CDHL modules and their homologs. The Ca - 
binding residues are conserved in Gram-positive bacteria. B) The dimeric structure of CDHL-1 &2. The dimer interface is shown as a CPK model. 
doi:1 0.1 371 /journal.ppat.1 0041 69.g008 



pH8.5 for P-GF&CDHL-l and 1.8 M (NH 4 ) 2 S0 4 , 0.1 M 
HEPES, pH 7.5 for CDHL-1&2. The crystals of SraP BR , |3- 
GF&CDHL-l and CDHL1&2 were transferred to the cryopro- 
tectant with the reservoir solution supplemented with 50% 
sucrose. The cryoprotectant for the crystals of L-lectin&P-GF 
consists of the reservoir solution supplemented with 30% glycerol. 

All crystals in the cryoprotectant were flash-cooled with liquid 
nitrogen prior to X-ray diffraction. Data for a single crystal were 
collected at 1 00 K in a liquid nitrogen stream using beamline 1 7U 
with a Q3 15r CCD (ADSC, MARresearch, Germany) at the 
Shanghai Synchrotron Radiation Facility (SSRF). All diffraction 
data were integrated and scaled with the program HKL2000 [38]. 

Structure determination and refinement 

The crystal structure of L-lectin&P-GF was determined using 
single-wavelength anomalous dispersion (SAD) phasing [39] 
method from a single crystal of SeMet-substituted protein to a 
maximum resolution of 2.10 A. The AutoSol program imple- 
mented in PHENIX [40] was used to locate the selenium atoms 
and calculate the phase, which was further improved with the 
program Buccaneer [41]. Automatic model building was carried 
out using Autobuild in PHENIX. The initial model was refined in 
REFMAC5 [42] and Phenix.refine and rebuilt interactively using 
the program COOT [43]. The model was used as the search 



model against 2.05 A SraPg^ data by molecular replacement 
using Molrep program as part of CCP4i [44] program suite. 
Electron density maps showed clear features of secondary 
structural elements for automatically building the C-terminal 
tandem cadherin-like modules using IPCAS [45]. The structures 
of P-GF&CDHL-l and CDHL-1&2 were determined by molec- 
ular replacement using the corresponding modules in the full- 
length SraPgR structure as the search model. The initial models 
were refined by simulated annealing using Phenix.refine to reduce 
the phase bias. Then the models were refined interactively using 
COOT and REFMAC5 until the R-factor and R-free values 
converged. All final models were evaluated with the programs 
MOLPROBITY [46] and PROCHECK [47]. Crystallographic 
parameters were listed in Table 1 . The | Fo \ - \ Fc | omit map of the 
sucrose molecule contoured at 3 a was calculated by FFT 
implemented in CCP4i. All structure figures were prepared with 
PyMOL (http:/ /www.pymol.org/). 

Determination of metals binding to the protein 

The purified SraP B R, CDHL-1&2, BR Y 367G an d L-lectin Y 367G 
in 20 mM Tris-Cl, pH 8.0, 100 mM NaCl were concentrated to 
30, 38, 30 and 21 mg/ml, respectively, and applied to the 
analyses. Briefly, 500 |ll of protein sample was subjected to 
digestion by the aqueous method using the HN0 3 and HC10 4 
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Table 1. Crystal parameters, data collection and structure refinement. 






BR 


SeMet- L-lectin&p-GF 


P-GF&CDHL-1 


CDHL-1&2 


Data collection 


Space group 


P3 2 21 


C2 


P2, 


P3,21 


Unit cell 


a, b, c (A) 


118.02, 118.02, 136.41; 


135.70, 179.64, 71.51; 


32.98, 46.46, 47.59; 


34.76, 34.76, 237.73; 


a, P, y 0 


90.00, 90.00, 1 20.00 


90.00, 89.77, 90.00 


90.00, 1 02.64, 90.00 


90.00, 90.000, 1 20.00 


Resolution range (A) 


50.00-2.05 (2.15-2.05) 3 


50.00-2.10 (2.18-2.10) 


50.00-1.60 (1.64-1.60) 


50.00-2.24 (2.32-2.24) 


Unique reflections 


69,226 (6,805) 


99,475 (9,904) 


18,581 (1,236) 


8,890 (866) 


Completeness (%) 


100 (100) 


100 (100) 


99.6 (99.1) 


99.6 (96.7) 


<l/a(l)> 


13.8 (5.8) 


14.3 (5.6) 


10.0 (3.7) 


1 6.7 (8.8) 


Rmerge b (%) 


13.7 (38.5) 


15.8 (35.7) 


10.3 (32.3) 


13.1 (38.9) 


Average redundancy 


6.9 (6.8) 


6.9 (6.9) 


3.0 (2.6) 


8.6 (5.7) 


Structure refinement 


Resolution range (A) 


50.00-2.05 


44.91-2.10 


46.44-1.60 


50.00-2.24 


R-factor7R-free d (%) 


18.2/20.5 


15.8/19.5 


16.4/20.2 


27.9/32.6 


Number of protein atoms 


3,675 


9,566 


1,251 


1,181 


Number of water atoms 


618 


1,448 


267 


8 


RMSD e bond lengths (A) 


0.009 


0.011 


0.008 


0.012 


RMSD bond angles (°) 


1.285 


1.337 


1.220 


1.569 


Mean B factors (A 2 ) 


29.5 


21.4 


21.0 


51.9 


Ramachandran plot f (residues, %) 


Most favored (%) 


98.2 


98.8 


98.2 


97.5 


Additional allowed (%) 


1.8 


1.2 


1.8 


2.5 


Outliers (%) 


0 


0 


0 


0 


PDB entry 


4M00 


4M01 


4M02 


4M03 



a The values in parentheses refer to statistics in the highest bin. 

b Rmerge = X hkl £jl' i ^' < ')" ^(hkO^/^hki^j'ithkl), where Ij(hkl) is the intensity of an observation and <l(hkl)> is the mean value for its unique reflection; Summations are 
over all reflections. 

c R-factor =£ h ||Fo(h)|-|Fc(h)||/£ h |Fo(h)|, where |Fo| and |Fc| are the observed and calculated structure-factor amplitudes, respectively. 
d R-free was calculated with 5% of the data excluded from the refinement. 
e Root-mean square-deviation from ideal values. 
'Categories were defined by Molprobity. 
doi:1 0.1 371 /journal.ppat.1 0041 69.t001 



(4:1, v/v) method. Afterwards, the digested samples were diluted 
with deionized water and analyzed by atomic absorption 
spectroscopy (Atomscan Advantage, Thermo Ash Jarell Corpora- 
tion, USA). 

Surface plasmon resonance (SPR) assays 

The binding affinities of the L-lectin module towards varying 
monosaccharides were determined by SPR. SPR experiments were 
performed at 25°C using a Biacore 3000 instrument using HBS 
(10 mM HEPES, pH 7.5, 150 mil NaCl) containing 0.005% (v/v) 
Tween 20 and a flow rate of 5 |il/min. The L-lectin module was 
covalently immobilized on the carboxymethyldextran surface of the 
CM5 chip. The chip was activated with EDC (N-ethyl-N-[3- 
dimethylaminopropyl] carbodi-imide)/NHS (N-hydroxysuccini- 
mide) solution, and the L-lectin module in 10 mM acetate buffer 
(pH 5.5) was injected into the flow channel. At the end, the sensor 
surface was blocked with 1 M ethanolamine. The blank channel 
was treated in the same way without protein injected. Each 
monosaccharide in the running buffer was incubated for 1 min in 
the flow-cells using the kinject mode. Both injection and dissociation 
steps last for 5 min. The sensor surface was regenerated with 
50 mM NaOH. All analyses were performed with the BIAeval 
software. The equilibrium responses were plotted versus monosac- 



charide concentrations and fitted to a 1:1 Langmuir binding model 
using the Origin 8.0 software (OriginLab Corp.). 

Computational docking 

The docking of NeuSAc or the trisaccharide Neu5Acot(2- 
3)Gal(3(l-4)GlcNAc to the L-lectin module of SraP BR was 
performed with AutoDock Vina software (version 1.0) [48], which 
uses a unique algorithm that implements a machine learning 
approach to its scoring function. This docking allowed a 
population of possible conformations and orientations for the 
ligand at the binding site to be obtained. Using AutoDock Tools 
(ADT) 1.5.4 [49], polar hydrogen atoms were added to the L- 
lectin structure, and its non-polar hydrogen atoms were merged. 
The protein and ligands were converted from a PDB format to a 
PDBQT format. All single-bonds within NeuSAc were set to allow 
rotation. A grid box covering the entire sucrose-binding site was 
used to place Neu5Ac freely. The results were sorted by binding 
affinity and visually analyzed using PyMOL. 

Chemical cross-linking 

Chemical cross-linking of purified CDHL-1&2 was performed 
using formaldehyde and bis(sulfosuccinimidyl) suberate (BS 5 ), 
which is a homobifunctional sulfo-N-hydroxysuccinimide ester 
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analog with a spacer arm length of 1.14 nm (Pierce). Briefly, for 
formaldehyde cross-linking assays of CDHL-1&2, 20 u.1 of the 
recombinant protein (2 mg/ml) was mixed with 20 u.1 PBS 
containing 2% formaldehyde, and the samples were incubated at 
25°C for 30 min and 1 hr, respectively. For BS' cross-linking 
assay of CDHL-1&2, 100 u.1 recombinant protein (2 mg/ml) was 
incubated with 5 mil BS J at 25°C for 30 min and 1 hr, 
respectively. The reaction was quenched by the addition of 
20 mM Tris-HCl, pH 8.0. Then the samples were separated by 
10% SDS-PAGE and were stained by Coomassie Brilliant Blue 
G250. Molecular mass markers for SDS-PAGE were purchased 
from Thermo Scientific (Wilmington, DE): beta-galactosidase, 
bovine serum albumin, ovalbumin, lactate dehydrogenase, REase 
Bsp98I, beta-lactoglobulin, lysozyme, which have a molecular 
weight of 1 16, 66.2, 45, 35, 25, 18.4 and 14.4 kDa, respectively. 

Bacterial strains, media and growth conditions 

Staphylococcus aureus NCTC 8325 and its derivative strains were 
grown in LB medium, and when necessary, erythomycin (2.5 mg/ 
ml) and chlorampenicol (15 mg/ml) were added. To generate the 
insertion-deletion mutagenesis, approximately 500 bp of upstream 
and downstream fragments of the BR region of sraP gene was 
amplified by polymerase chain reaction (PCR), digested with Pstl/ 
Sail and Bamlil/ Xbal, respectively, and then ligated to either end 
of a double-digested (Bamlil/ 'SaK] erythromycin-resistance gene 
(Erm R ), which was amplified from plasmid pECl. The three 
fragments were ligated with the erythromycin-resistance gene in 
the middle, and then cloned in the temperature-sensitive shuttle 
vector pBT2. The resulting plasmid was transformed by electro- 
poration into S. aureus strain RN4220 for propagation, and then 
transformed into S. aureus NCTC 8325 for allelic exchange. 
Mutants were screened and further checked by PCR and 
sequencing. 

Cell adhesion and invasion assays 

Adhesion and invasion experiments were performed as 
described previously [50]. A549 human respiratory epithelial cells 
were grown in Dulbecco's modified Eagle's medium (DMEM, 
Gibco) supplemented with 10% fetal bovine serum (FBS), 5 mM 
glutamine, penicillin (5 ug/ml) and streptomycin (100 ug/ml). 
Approximately 5xl0 5 cells were seeded into 24-well tissue culture 
plates, and allowed to grow in 5% CO z at 37°C. Before use, the 
monolayers were washed twice with PBS. For adhesion assays, S. 
aureus grown to OD 600nm of 0.5 in LB medium were resuspended 
in cell culture medium without serum. Bacteria were diluted to a 
concentration of 2x10 CFU/ ml and were used to infect the 
confluent cell monolayers at 37°C for 1 hr. After incubation, the 
infected monolayers were washed five times with PBS to remove 
non-adhered bacteria, and treated with 200 u.1 trypsin (2.5 mg/ml) 
at 37°C for 3 min to release the adhered bacteria. A549 cells were 
lysed with 0.05% Triton X-100. The number of adhered bacteria 
was determined by plating serial dilutions of the recovered 
bacterial suspensions onto LB agar. For the deglycosylation of 
A549 cells, The cells were incubated with DMEM media 
containing 0.008 units ml 1 purified Clostridium perfringens neur- 
aminidase (Sigma), 40 nM purified [S-galactosidase (Spr0565) from 
S. pneumoniae or 0.0025 units ml 1 of S. pneumoniae fS-N-acetylglu- 
cosaminidase (Sigma) at 37°C for 4 hr in 5% CO2 [34]. 

For invasion assays, the bacteria in each well, after incubated in 
0.5 ml DMEM for 1 hr, were incubated in 1 ml fresh DMEM 
containing 14 u.g/ml of gentamicin (Sigma) for another 1 hr. Cell 
monolayers were washed three times with sterile PBS and lysed 
with 0.05% Triton X-100. The internalized bacteria were counted 
by plating serial dilutions of the recovered bacterial suspensions 



onto LB agar. Experiments were performed in triplicate. Data 
corresponding to adhesion and invasion were compared using the 
Mann-Whitney tests. Statistical differences were determined with 
the t-test. 

Immunofluorescence assays 

Immunofluorescence assays were performed as described 
previously [51]. Briefly, GFP and GFP-fused proteins at 10 uM 
were suspended in the media in the absence of serum and 
antibiotics, and incubated on ice for 2 hr with the monolayer of 
A549 cells grown on the 12-mm glass coverslips. For inhibition 
assays, Neu5Ac at 5 and 10 mM was pre-incubated with GFP- 
fused SraP BR protein, respectively. After incubation, cells were 
sequentially washed three times with cold PBS, fixed with 4% 
paraformaldehyde for 10 min, permeabilized with 0. 1 % Triton X- 
1 00 for 2 min, and incubated with GFP-tag mouse antibody and a 
fluorescein isothiocyanate (FITC)-conjugated goat anti-mouse IgG 
antibody. The nuclei were stained with 4',6-diamidino-2-pheny- 
lindole (DAPI) reagent. Slides were examined with a Zeiss 
LSM710 confocal scanning fluorescence microscope (Carl Zeiss, 
Jena, Germany) with a Plan-Apocromat 20x/0.8NA objective. 
Confocal parameters set for immunofluorescence detection were 
taken as standard settings. The excitation wavelength is 405 nm 
and the emission wavelength is 410-492 nm. The confocal images 
were collected and processed with the software ZEN 2009. 
Experiments were performed in triplicate, with three or more 
replicate wells tested for each experimental condition. 

Molecular dynamics simulations 

Molecular dynamics simulations were performed for SraP BR in 
the Ca 2+ -bound and the Ca 2+ -free states, respectively. For each 
simulation, the system was placed in a TIP3P [52] water box with 
a distance of at least 12 A to the box boundaries. Ions were added 
to neutralize the system and to result in a concentration of 0.15 M 
NaCl. The solvated protein was subjected to energy minimization 
employed the steepest descent algorithm and conjugate gradient, 
respectively. Simulations were performed with a parallel imple- 
mentation of the GROMACS (version 4.5.5) package [53] using 
the AMBER03 force field [54]. MD productions were run for 
20 ns using a time step of 2 fs and the NPT ensemble [55]. 
Covalent bonds were constrained using the LINCS algorithm 
[56], while the cutoff distances for the Coulomb and van der 
Waals interactions were set to 0.9 and 1.4 nm, respectively. The 
long-range electrostatic interactions were treated by the PME 
algorithm [57] with a tolerance of 10 -5 and an interpolation order 
of 4. Structure visualization was performed with VMD [58]. 

Small-angle X-ray scattering (SAXS) experiments 

SAXS was used to investigate the overall conformations of 
SraPg R in the presence or absence of calcium. The full-length 
SraPg R at 1.0 and 5.0 mg/ml was analyzed, either in 10 mM 
calcium chloride or in 50 mM EDTA. SAXS data were collected 
on the 12ID beamline of Advanced Photon Sources (APS) at the 
Argonne National Laboratory using the Pilatus 2M detector 
(DECTRIS, Switzerland). The scattering patterns were measured 
with a 1-2 second exposure time for each collected frame, and 
twenty frames were taken for each sample to optimize the signal- 
to-noise ratio. To reduce the radiation damage, a flow cell made of 
a cylindrical quartz capillary with a diameter of 1 .5 mm and a wall 
of 10 urn was used during the data collection process. No 
concentration effect was observed. All SAXS curves were 
measured at the room temperature over the range of momentum 
transfer 0.006<s<0.82 A" 1 (where s = 4ti sin(9)A, 26 is the 
scattering angle, and the X-ray wavelength X is 1 .033 A). The data 
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were processed using the PRIMUS [59] program package and 
standard procedures. The forward scattering (1(0)) and the radius 
of gyration (R^ were evaluated using the Guinier approximation 
assuming that at very small angles (s<1.3/i? i ,) the intensity is 
represented as I(s) = l(0)exp(— (si^,) 2/ ' 5 ). 

The program GNOM [60] was used to calculate D max and the 
interatomic distance distribution function p(r). The particle shape of 
each measured sample was reconstructed ab initio using the programs 
DAMMIN [61] and GASBOR [62]. The scattering patterns of the 
atomic crystal structures for SraP were calculated using the program 
CRYSOL. For the ab initio analyses and modeling, multiple runs 
were performed to verify the stability of the solution. 

Supporting Information 

Figure SI The intermodule twist along the axis of SraP BR . The 
merged structure of SraP BR (in orange) was generated by 
sequentially superimposing the same module against each other 
from the three structures (L-lectin&P-GF, P-GF&CDHL-l and 
CDHL-1&2). The L-lectin, (3-GF, CDHL-1 and CDHL-2 
modules of SraP B R are shown in cyan, red, yellow and green, 
respectively. The merged SraP BR was superimposed against the 
crystal structure of SraP BR with the two CDHL-2 modules aligned. 
(TIF) 

Figure S2 The CD spectra of A) SraP BR and B) the L-lectin 
module. The results demonstrated that mutation of Y367G did not 
introduce significant changes to the protein structures. 
(TIF) 

Figure S3 Quantitation of the GFP fluorescent signals of SraP BR 
and mutants. The fluorescent signals for each protein were 
quantified by calculating the mean gray values of three represen- 
tative frames using the ImageJ software (http:/ /imagej.nih.gov/ij/). 
The average mean gray value and standard error of the mean 
(SEM) derived from triplicate treatments are indicated as bar graph. 
(TIF) 
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(TIF) 
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